PyDigger - unearthing stuff about Python


NameVersionSummarydate
nahiarhdNLP 1.2.4 Advanced Indonesian Natural Language Processing Library 2025-07-24 09:27:10
contextgem 0.12.0 Effortless LLM extraction from documents 2025-07-23 19:05:46
simple-anonymizer 0.1.18 Privacy-first text anonymization tool with enterprise-grade accuracy for removing PII from documents 2025-07-23 07:51:26
kallia 0.1.3 Semantic Document Processing Library 2025-07-20 07:41:41
fleetfluid 0.1.2 AI Agent Functions for ETL Processing 2025-07-19 19:31:12
chonkie 1.1.1 🦛 CHONK your texts with Chonkie ✨ - The no-nonsense chunking library 2025-07-18 05:08:19
splurge-tools 0.2.6 Python tools for data type handling and validation 2025-07-12 20:27:52
profanex 0.0.2 None 2025-07-12 19:54:01
html-to-markdown 1.8.0 A modern, type-safe Python library for converting HTML to Markdown with comprehensive tag support and customizable options 2025-07-12 10:22:42
rs-bpe 0.1.0 A ridiculously fast Python BPE (Byte Pair Encoder) implementation written in Rust 2025-03-19 05:58:24
reliq 0.0.33 Python ctypes bindings for reliq 2025-02-26 19:51:44
smoothtext 0.3.0 A Python library for text readability analysis, supporting multiple languages. 2025-02-16 21:50:05
dom-tree-sitter-language-pack 0.4.0 Extensive Language Pack for Tree-Sitter 2025-02-15 08:15:55
PyTokenCounter 1.6.4 A Python library for tokenizing text and counting tokens using various encoding schemes. 2025-02-03 23:06:07
ts-tokenizer 0.1.19 TS Tokenizer is a hybrid (lexicon-based and rule-based) tokenizer designed specifically for tokenizing Turkish texts. 2025-01-30 19:59:44
tikara 0.1.5 The metadata and text content extractor for almost every file type. 2025-01-26 23:33:40
safwaText 0.1.0 A Python package for Arabic text preprocessing, including cleaning, normalization, stemming, and stopword removal. 2025-01-24 16:07:06
long2short 0.1.3 A flexible text summarization library to summarize long documents supporting multiple LLM providers 2025-01-23 11:13:46
indoxMiner 0.1.4 Indox Data Extraction 2024-12-29 09:52:42
yurenizer 0.2.2 A library for standardizing terms with spelling variations using a synonym dictionary. 2024-12-08 08:03:52
hourdayweektotal
60132310343303085
Elapsed time: 4.19764s